AITopics | hyperparameter optimization method

Collaborating Authors

hyperparameter optimization method

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Scalable Nested Optimization for Deep Learning

Lorraine, Jonathan

arXiv.org Machine LearningJul-1-2024

Gradient-based optimization has been critical to the success of machine learning, updating a single set of parameters to minimize a single loss. A growing number of applications rely on a generalization of this, where we have a bilevel or nested optimization of which subsets of parameters update on different objectives nested inside each other. We focus on motivating examples of hyperparameter optimization and generative adversarial networks. However, naively applying classical methods often fails when we look at solving these nested problems on a large scale. In this thesis, we build tools for nested optimization that scale to deep learning setups.

hyperparameter optimization algorithm, max 10-step lyapunov exponent, scalable nested optimization, (15 more...)

arXiv.org Machine Learning

2407.01526

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > New Mexico > Bernalillo County > Albuquerque (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(7 more...)

Genre: Research Report > New Finding (0.67)

Industry:

Leisure & Entertainment > Games (1.00)
Education (0.67)
Health & Medicine (0.67)
Energy (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Domhan

AAAI ConferencesFeb-8-2022, 12:37:47 GMT

Deep neural networks (DNNs) show very strong performance on many machine learning problems, but they are very sensitive to the setting of their hyperparameters. Automated hyperparameter optimization methods have recently been shown to yield settings competitive with those found by human experts, but their widespread adoption is hampered by the fact that they require more computational resources than human experts. Humans have one advantage: when they evaluate a poor hyperparameter setting they can quickly detect (after a few SGD steps) that the resulting network performs poorly and terminate the corresponding evaluation to save time. Here, we mimic this early termination of bad runs based on a probabilistic model that extrapolates performance from the first part of a learning curve. Experiments with different neural network architectures show that our resulting approach speeds up state-of-the-art hyperparameter optimization methods for DNNs roughly twofold, enabling them to find DNN settings that yield better performance than those chosen by human experts.

artificial intelligence, human expert, machine learning, (2 more...)

AAAI Conferences

Industry: Education > Focused Education > Special Education (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

5 Hyperparameter Optimization Methods Every Data Scientist Should Use

#artificialintelligenceMar-19-2021, 01:05:10 GMT

Before starting our quest for our best model, we want to find a dataset and a model first. We chose to use Amazon Us Reviews. The goal is to predict its target feature (the number of stars attributed) using customer reviews. Below, we're defining the model whose hyperparameters we will try to optimize: If you're not familiar with pipelines, don't hesitate to check out our previous article! Before we get to the optimization part, we first need to know what are our model's hyperparameters, right?

grid search, hyperparameter, hyperparameter optimization method, (15 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.41)
Information Technology > Artificial Intelligence > Machine Learning (0.31)

Add feedback

Weighted Random Search for CNN Hyperparameter Optimization

Andonie, Razvan, Florea, Adrian-Catalin

arXiv.org Machine LearningMar-30-2020

Nearly all model algorithms used in machine learning use two different sets of parameters: the training parameters and the meta-parameters (hyperparameters). While the training parameters are learned during the training phase, the values of the hyperparameters have to be specified before learning starts. For a given dataset, we would like to find the optimal combination of hyperparameter values, in a reasonable amount of time. This is a challenging task because of its computational complexity. In previous work [11], we introduced the Weighted Random Search (WRS) method, a combination of Random Search (RS) and probabilistic greedy heuristic. In the current paper, we compare the WRS method with several state-of-the art hyperparameter optimization methods with respect to Convolutional Neural Network (CNN) hyperparameter optimization. The criterion is the classification accuracy achieved within the same number of tested combinations of hyperparameter values. According to our experiments, the WRS algorithm outperforms the other methods.

hyperparameter, hyperparameter optimization, optimization, (16 more...)

arXiv.org Machine Learning

doi: 10.15837/ijccc.2020.2.3868

2003.133

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > Canada > Ontario > Toronto (0.04)
North America > United States > New York > New York County > New York City (0.04)
(6 more...)

Genre: Research Report (1.00)

Industry: Information Technology (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.91)

Add feedback